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Fast detection of communication ^patterns in distributed executi ons I 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: ^ pdf(4.21 MB) Additional Information: full citation , abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 

Computing graphical queries over XML data 
Sara Comai, Ernesto Damiani, Piero Fraternali 

October 2001 ACM Transactions on Information Systems (TOIS), Volume 19 issue 4 

Additional Information: full citation , abstract , references , citings, index 
terms 

The rapid evolution of XML from a mere data exchange format to a universal syntax for 
encoding domain-specific information raises the need for new query languages specifically 
conceived to address the characteristics of XML. Such languages should be able not only to 
extract information from XML documents, but also to apply powerful transformation and 
restructuring operators, based on a well-defined semantics. Moreover, XML queries should 
be natural to write and understand, as nontechnical person ... 

Keywords: Document restructuring, graphical query languages, semantics 
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3 Technique for auto matically corr ecting words in t ext 
Karen Kukich 

December 1992 ACM Computing Surveys (CSUR), Volume 24 issue 4 

Additional Information: fu l l c i tat i o n, abstr ac t, references, citings, index 
terms, review 



Full text available: " g pdf(6.23 MB) 



Research aimed at correcting words in text has focused on three progressively more difficult 
problems:(l) nonword error detection; (2) isolated-word error correction; and (3) context- 
dependent work correction. In response to the first problem, efficient pattern-matching and 
n-gram analysis techniques have been developed for detecting strings that do not appear in 
a given word list. In response to the second problem, a variety of general and application- 
specific spelling cor ... 
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Keywords: n-gram analysis, Optical Character Recognition (OCR), context-dependent 
spelling correction, grammar checking, natural-language-processing models, neural net 
classifiers, spell checking, spelling error detection, spelling error patterns, statistical- 
language models, word recognition and correction 

4 Meta data management: Rondo: a programmi n g platform for generic model Q 
management 

Sergey Melnik, Erhard Rahm, Philip A. Bernstein 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data 

Full text available: *g pdf(39 2.38 K B) Additional Information: full citation, abstra ct, refe rences , jnde x terms 

Model management aims at reducing the amount of programming needed for the 
development of metadata-intensive applications. We present a first complete prototype of a 
generic model management system, in which high-level operators are used to manipulate 
models and mappings between models. We define the key conceptual structures: models, 
morphisms, and selectors, and describe their use and implementation. We specify the 
semantics of the known model-management operators applied to these structures, ... 



5 Natural language and voice output for relational data base systems Q 
P. B. Powell, P. Thompson 

January 1978 Proceedings of the 1978 annual conference - Volume 2 

Full text available: *Q pdf(874.02 KB) Additional Information: full citation , abstract , references , index terms 

A method to output answers and reasons in English and/or voice in a question-answering 
system is presented. The system for which the output is designed has a relational data 
base. A description of &pgr;-relations is given. A &pgr;-relation is one which has multiple 
values for the predicate or arguments of the relation. The natural language output uses 
predicate and function patterns for the &pgr; -relations. Pattern construction and pattern 
expansion into English sentences are des ... 

Keywords: Answer/reason extraction, MRPPS 3.0, Natural language output, Question- 
answering system, Relational data base, Voice output 

6 Draft Proposed: American National Standard— Graphical Kernel System Q 
Technical Committee X3H3 - Computer Graphics 

February 1984 ACM SIGGRAPH Computer Graphics, Volume 18 issue SI 
Full text available: ^pdf(16.07 MB) Additional Information: full citation 



7 PLl workshops: On-the-fly model checking from interval logic specifications Q 
Miguel J. Hornos, Manuel I. Capel 

December 2002 ACM SIGPLAN Notices, Volume 37 issue 12 

Full text available: ^pdf(1 60.25 KB) Additional Information: full citation , a bstract , references 

Future Interval Logic (FIL) and its intuitive graphical representation, Graphical Interval 
Logic (GIL), can be used as the formal description language of model checking tools to 
verify hardware and software systems. An interval clearly defines the temporal scope over 
which properties are evaluated. From interval formulas specifying the temporal behavior of 
a system we obtain their semantically equivalent Buchi automata, but in such a way that 
our algorithm can be integrated into an on-the-f ... 

8 Reduced distance routing in single-state shuffle-exchange interconnection networks Q 
Ziao-Nan Tan, Kenneth C. Sevcik 

May 1987 ACM SIGMETRICS Performance Evaluation Review , Proceedings of the 1987 
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ACM SIGMETRICS conference on Measurement and modeling of computer 
systems, Volume 15 Issue 1 
Full text available: ^pdf(1.57 MB) Additional Information: full cit ation , abstract, re ferences , index terms 

In multiprocessor architectures, it is frequently necessary to provide parallel communication 
among a potentially large number of processors and memories. Among the many 
interconnection schemes that have been proposed and analyzed, shuffle-exchange 
networks have received much attention due to their ability to allow a message to pass from 
any node to any other node in a number of steps that grows only logarithmically with the 
number of interconnected nodes (in the absence of contention) whil ... 



9 Search strategy and selection function for an inferential relational system 
Jack Minker 

March 1978 ACM Transactions on Database Systems (TODS), Volume 3 issue l 

Full text available-H! | P df(2.29MB) Additi0nal ,nformation: ful( citation ' references, citings, index 

^ terms 

An inferential relational system is one in which data in the system consists of both explicit 
facts and general axioms (or "views"). The general axioms are used together with the 
explicit facts to derive the facts that are implicit (virtual relations) within the system. A top- 
down algorithm, as used in artificial intelligence work, is described to develop inferences 
within the~system~." The top-down approach starts witrvthe~q~uery, a conjunction of relations, 
to be answered. Either ... 



Keywords: answer and reason extraction, heuristics, inference mechanism, logic, predicate 
calculus, relational databases, search strategy, selection function, top-down search, virtual 
relations 



1 0 O n a ccel eratin g pattern m a t ching for technology mapping Q 
Yusuke Matsunaga 

November 1998 Proceedings of the 1998 IEEE/ACM international conference on 
Computer-aided design 

Full text available: ^ pdf(506. 73 KB) Additional Information: full citation , references , citings , index terms 



11 Industry/g ove rnment track papers: Learning to detect malicious ex ecutables in the wild Q 
Jeremy Z. Kolter, Marcus A. Maloof 

August 2004 Proceedings of the 2004 ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available: "g| pdf(216.52 KB) Additional Information: full citation, abstract, references, index terms 

In this paper, we describe the development of a fielded application for detecting malicious 
executables in the wild. We gathered 1971 benign and 1651 malicious executables and 
encoded each as a training example using n-grams of byte codes as features. Such 
processing resulted in more than 255 million distinct n-grams. After selecting the most 
relevant n-grams for prediction, we evaluated a variety of inductive methods, including 
naive Bayes, decision trees, support vector machines, and boosting. ... 

Keywords: concept learning, data mining, malicious software, security 



12 Pat t ern Matching in Trees 

Christoph M. Hoffmann, Michael J. O'Donnell 

January 1982 Journal of the ACM (JACM), Volume 29 Issue l 

Full text available: ^ pdf(1 .45 MB) Additional Information: full citation , references , citings, index terms 
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Multi Relational Data Mining (MRDMV A survey of kernels for struc ture d data Q 
Thomas Gartner 

July 2003 ACM SIGKDD Explorations Newsletter, volume 5 issue 1 

Full text available: pdf(1.16 MB) Additional Information: full citation , abstract , references, citings 

Kernel methods in general and support vector machines in particular have been successful 
in various learning tasks on data represented in a single table. Much Veal-world' data, 
however, is structured - it has no natural representation in a single table. Usually, to apply 
kernel methods to 'real-world' data, extensive pre-processing is performed to embed the 
data into areal vector space and thus in a single table. This survey describes several 
approaches of defining positive definite kernels on ... 

Keywords: inductive logic programming, kernel methods, multi-relational data mining, 
structured data 



14 Ch aracteri z i n g the be ha vior of a program usi n g mu lti p l e-len gt h N- g rams Q 
Carla Marceau 

February 2001 Proceedings of the 2000 workshop on New security paradigms 

Full text available: ^pdf(107 MB) Additional Information: full c itatio n, references, citings , ind ex terms 



Keywords: computational immunology, finite automata, intrusion detection, string 
processing 



15 Movement in active production networks 
Mark A. Jones, Alan S. Driscoll 

July 1985 Proceedings of the 23rd conference on Association for Computational 
Linguistics 

Full text available: Wj pdf(412.46 KB) 

J=| Additional Information: full cttation. abstract, references 

W Publisher Site 

We describe how movement is handled in a class of computational devices called active 
production networks (APNs). The APN model is a parallel, activation-based framework that 
has been applied to other aspects of natural language processing. The model is briefly 
defined, the notation and mechanism for movement is explained, and then several 
examples are given which illustrate how various conditions on movement can naturally be 
explained in terms of limitations of the APN device. 

16 Si milarity-based retrieval for div ers e books h elf software re p osi t or y users 
Igor Jurisica 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: f| pdf(1 26.60 KB) Additional Information: full citation , abstract , ref erences , index terms 

The paper presents a similarity-based retrieval framework for a software repository that 
aids the process of maintaining, understanding, and migrating legacy software systems 
[12]. Designing a software repository involves three issues: (1) information content; (2) 
information representation; and (3) strategies for accessing repository artifacts. Assuming 
the architecture presented in [12] we extend the retrieval system to support imprecise 
queries, iterative browsing, and diverse users. Because o ... 

17 Short papers: Discovery of ads web hosts through traffic data analys is 
V. Bacarella, F. Giannotti, M. Nanni, D. Pedreschi 

June 2004 Proceedings of the 9th ACM SIGMOD workshop on Research issues in data 
mining and knowledge discovery 

Full text available: ^ pdfd 89.30 KB) Additional Information: full citation , abstract , refere nces 
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One of the most actual problems on web crawling — the most expensive task of any search 
engine, in terms of time and bandwidth consumption — is the detection of useless segments 
of Internet. In some cases such segments are purposely created to deceive the crawling 
engine while, in others, they simply do not contain any useful information. Currently, the 
typical approach to the problem consists in using a human-compiled blacklist of sites to 
avoid (e.g., advertising sites and web counter ... 



18 A systematic study of functional language implementations 
Remi Douence, Pascal Fradet 

March 1998 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 20 Issue 2 

Full text available: S Pdf(273.98 KB) Additional Information: full citation , abstract , references , index terms 

We introduce a unified framework to describe, relate, compare, and classify functional 
language implementations. The compilation process is expressed as a succession of 
program transformations in the common framework. At each step, different transformations 
model fundamental choices. A benefit of this approach is to structure and decompose the 
implementation process. The correctness proofs can be tackled independently for each step 
and amount to proving program transformations in the functi ... 

Keywords: abstract ma chin es, combinators, compilers, functional programming, program 
transformation 



19 Pattern matching for sets of segments 

Alon Efrat, Piotr Indyk, Suresh Venkatasubramanian 

January 2001 Proceedings of the twelfth annual ACM-SIAM symposium on Discrete 
algorithms 

Full text available: Q pdf(953.19 KB) Additional Information: fu ll citation , ab stract , references, index terms 

In this paper we present algorithms for a number of problems in geometric pattern 
matching where the input consist of a collections of segments in the plain. Our work 
consists of two main parts. In the first, we address problems and measures that relate to 
collections of orthogonal line segments in the plane. Such collections arise naturally from 
problems in mapping buildings and robot exploration. 



We propose a new measure of segment similarity called a coverage measure, and pr ... 

20 Hypermedia stru c t ur es and the division o f labor in meetin g roo m collaboration 
Gloria Mark, Jorg M. Haake, Norbert A. Streitz 

November 1996 Proceedings of the 1996 ACM conference on Computer supported 
cooperative work 

Full text available: ff] pdf(1.51 MB) Additional Information: full citation , references , citings , index terms 



Keywords: collaborative style, collaborative work, electronic meeting room, electronic 
whiteboard, empirical studies, group process, hypermedia user-interface, task division 
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21 Functional programming witli graphs 
Martin Erwig 

August 1997 ACM SIGPLAN Notices , Proceedings of the second ACM SIGPLAN 

international conference on Functional programming, Volume 32 issue 8 

Additional Information: full citation , abstract , references, citings, index 
terms 



Full text available: 



Graph algorithms expressed in functional languages often suffer from their inherited 
imperative, state-based style. In particular, this impedes formal program manipulation. We 
show how to model persistent graphs in functional languages by graph constructors. This 
provides a decompositional view of graphs which is very close to that of data types and 
leads to a "more fictional" formulation of graph algorithms. Graph constructors enable the 
definition of general fold operations for graphs. We pres ... 



22 Graphs and trees: Efficiently mining frequent trees in a forest 
Mohammed J. Zaki 

July 2002 Proceedings of the eighth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Additional Information: full citation, abstract, references, citings, index 
terms 



Full text available: fBpdf( 1.26 MB) 



Mining frequent trees is very useful in domains like bioinformatics, web mining, mining 
semistructured data, and so on. We formulate the problem of mining (embedded) subtrees 
in a forest of rooted, labeled, and ordered trees. We present TREEMINER, a novel algorithm 
to discover all frequent subtrees in a forest, using a new data structure called scope-list. 
We contrast TREEMINER with a pattern matching tree mining algorithm 
(PATTERN MATCHER). W ... 

23 Timing-driven hierarchical global routing with wire-sizing and buffer-insertion for VLSI Q 
with multi-routing-layer 

Takahiro Deguchi, Tetsushi Koide, Shin'ichi Wakabayashi 

January 2000 Proceedings of the 2000 conference on Asia South Pacific design 
automation 

Full text available: fTl pdf(192.18 KB) Additional Information: full citation , references , citings 



24 Pattern-based tree attribution 
Charles Farnum 

February 1992 Proceedings of the 19th ACM SIGPLAN-SIGACT symposium on Principles 
of programming languages 
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Full text available: *gj pdf(1.32 MB) Additional Information: full citation , abstrac t, references, citings, index 

terms 

Attribute grammars have been used for many language-oriented tasks, including the formal 
description of semantics and the implementation of compilation tasks from simple type 
checking through code generation. Despite their successful use, attribute grammars have 
some disadvantages, including the monolithic nature of the grammar and the fixed factoring 
of all attribute descriptions by a single set of grammar productions. Attribute pattern sets 
provide a more expressive attri ... 

25 An interpreter generator using tree pattern matching ! 
Christoph M. Hoffmann, Michael J. O'Donnell 

January 1979 Proceedings of the 6th ACM SIGACT-SIGPLAN symposium on Principles of 
programming languages 

Full text available: ffl pdf(852.20 KB) Additional Information: full citation , abstract , references , citings 



Equations provide a rich, intuitively understandable notation for describing nonprocedural 
computing languages such as LISP and Lucid. In this paper, we present techniques for 
automatically generating interpreters from equations, analagous to well-known techniques 
for generating parsers from context-free grammars. The interpreters so generated are 
exactly faithful to the simple traditional mathematical meaning of the equations-no lattice- 
theoretic or f ixpoi nt i dea s~a re n eed ed to exp la i n th e corr ~rr~ 

26 Technical papers: software architecture I: Design Pattern Rationale Graphs: linking 
design to source 

Elisa L A. Baniassad, Gail C. Murphy, Christa Schwanninger 

May 2003 Proceedings of the 25th international conference on Software engineering 

Full text available:^ ,„ A An |f| , , . . 

T§§£QI(L13 M B) ^ Additional Information: full citation, abstract, refe ren ces 

Publisher Site 

A developer attempting to evolve a system in which design patterns have been applied can 
benefit from knowing which code implements which design pattern. For instance, the 
developer may be able to understand the purpose, or to assess the flexibility of the code, 
more quickly. The degree to which the developer benefits depends upon their 
understanding of the pattern. Achieving an in-depth understanding of even a simple pattern 
can be difficult as pattern descriptions span several pages of text, a ... 

27 Qn the construction of efficient match networks 
Jack S. E. Tan, Jaideep Srivastava, Sashi Shekhar 

April 1992 Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing: 
technological challenges of the 1990's 

Full text available: "g| pdf(741.21 KB) Additional Information: full citation , references , index terms 



28 A theory of parameterized pattern matching: algorithms and ^BBiisattops Q 
Brenda S. Baker 

June 1993 Proceedings of the twenty-fifth annual ACM symposium on Theory of 
computing 

Full text available: | |pdf(1.27 MB) Additional Information: full citation , r eferences , citings, index terms 



29 Software evolution: Generating programming languag e-ba sMj^a^ Q 
Santanu Paul, Atul Prakash 

October 1993 Proceedings of the 1993 conference of the Centre for Advanced Studies 
on Collaborative research: software engineering - Volume 1 

Full text available: *g| pdf(1.55 MB) Additional Information: full citation , abstra ct, references 

This paper is based on a logical extension of our past work in pattern matching tools [22, 
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24, 25 ] for reverse engineering. We explore two new directions: first, we investigate the 
need for new and more powerful source code and pattern representations to support a 
richer set of queries; and second, we develop the concept of automatic generation of 
pattern matchers for different programming languages starting from a high-level 
specification of the programming language. A generator will eliminate ... 

30 Topology matching for fully automatic similarity estimation o f 3D shapes j 
Masaki Hilaga, Yoshihisa Shinagawa, Taku Kohmura, Tosiyasu L. Kunii 
August 2001 Proceedings of the 28th annual conference on Computer graphics and 
interactive techniques 

Full text available: f B Pdf(463.27 KB) Additional lnf °™ation: full citation, abstract, references , dt[ngs, index 
* terms 

There is a growing need to be able to accurately and efficiently search visual data sets, and 
in particular, 3D shape data sets. This paper proposes a novel technique, called Topology 
Matching, in which similarity between polyhedral models is quickly, accurately, and 
automatically calculated by comparing Multiresolutional Reeb Graphs (MRGs). The MRG thus 
operates well as a search key for 3D shape data sets. In particular, the MRG represents the 
skeletal and topological structure of a 3 ... 

Keywords: 3D sea rch^omputer vision, sha pe recognition 



31 An improvement to bottom-up tree pattern matching 
D. R. Chase 

October 1987 Proceedings of the 14th ACM SIGACT-SIGPLAN symposium on Principles 
of programming languages 

Full text available: fp pdf(954.02 KB) Additional Information: full citation , references , citings , index terms 



32 A gra ph-based framework for feature recognition Q 
Sashikumar Venkataraman, Milind Sohoni, Vinay Kulkarni 

May 2001 Proceedings of the sixth ACM symposium on Solid modeling and 
applications 

Full text available: ^| pdf(941 .79 KB) Additional Information: full citation, abstract, references, index terms 

This paper discusses a feature recognition system for recognizing User Defined Features 
(UDF). The feature recognizer uses a graph-based approach to represent and recognize 
features. An attributed face adjacency graph consisting of topological and geometric 
attributes is used to represent UDF's. The feature recognition step involves finding similar 
subgraphs in the part graph. The novelty of the framework lies in the usage of a rich set of 
attributes to recognize a wide range of features effi ... 

Keywords: attributed graphs, boundary representation, design tree, feature interactions, 
feature parameterization, feature suppression, feature-based design, graph grammars, 
user-defined features 
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June 2001 Proceedings of the 38th conference on Design automation 

Full text available: ||| pdf(231 .52 KB) Additional Information: full citation, a bstra ct, refere nces, inde x terms 

In this paper, a new structural matching algorithm for technology mapping is proposed. The 
algorithm is based on a key observation that the matches for a node in a subject Boolean 
network are related to the matches for its children. The structural relationships between the 
library cells are modeled using a lookup table. The proposed method is fast, has low 
memory usage, and is easy to implement. Experimental results show speedups of 20x over 
Matsunaga's fast mapping approach, and orders of ... 
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January 1971 Proceedings of the 1971 26th annual conference 

Full text available: g pdf(945.96 KB) Additional Information: full citation , abstract , references , index terms 

The design of a pattern manipulation language, PIRL, is described here. PIRL can handle 
arbitrary, oriented patterns (i. e., subgraphs) of lists, nodes, numeric and Hollerith data on 
many levels of abstraction in a concise, legible manner. Patterns and lists may be inserted, 
retrieved, deleted, indexed, compared, named, intersected, united, and complemented. 
Pattern names may be referenced to a lower level of abstraction and pattern forms may be 
quantified. PIRL should be of considerable va ... 

Keywords: Association, Graph, Information retrieval, List attribute, Pattern, Programming 
language 
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Keywords: co-design, design space exploration, instruction set synthesis, pattern 
matching 
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Keywords: linear concurrent programming, programming languages for distributed 
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Most work in the ad hoc music retrieval field has focused on the retrieval of monophonic 
documents using monophonic queries. Polyphony adds considerably more complexity. We 
present a method by which polyphonic music documents may be retrieved by polyphonic 
music queries. A new harmonic description technique is given, wherein the information from 
all chords, rather than the most significant chord, is used. This description is then combined 
in a new and unique way with Markov statistical methods t ... 
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Tree pattern matching is a fundamental operation that is used in a number of programming 
tasks such as mechanical theorem proving, term rewriting, symbolic computation, and 
nonprocedural programming languages. In this paper, we present new sequential 
algorithms for nonlinear pattern matching in trees. Our algorithm improves upon know tree 
pattern matching algorithms in important aspects such as time performance, ease of 
integration with several reduction strategies and ability to avoid unne ... 

Keywords: nonlinear pattern matching, normalization, rewriting, theorem proving 

40 S pe ci al session on software s ys tems #1 : GXL: a new g raph tran s formati on lan g ua ge Q 
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GXL - the graph transformation language presented in this abstract is a programmable 
graph rewriting language that combines ideas from current tree rewriting technology, 
exemplified by the TXL language, and general graph rewriting systems. From TXL it inherits 
the idea of scoping (limitation of a transformation rule to a particular sub tree or sub graph) 
and parameterization (working with multiple independent copies of a subtree or subgraph). 
From graph rewriting it inherits the generalization t ... 

Keywords: graph grammars, graph rewriting, tree rewriting 
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